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Introduction 

_ Occasionally technical developments in science occur that enable 

% vSS&it-: ~? • j;^ leaps forward in our knowledge and increase the potential tor 

^v^. 1 innovation. Molecular biology and biomedical research have 

recently experienced just such a revolutionary change with the 
development of gene manipulation. The term gene manipulation 
can be applied to a variety of sophisticated in-vivo genetics as 
well as to in-vitro techniques. In fact, in most Western countries 
there is a precise legal definition of gene manipulation as a result 
of Government legislation to control it. In the United Kingdom 
gene manipulation is defined as 'the formation of new combi- 
nations of heritable material by the insertion of nucleic acid 
molecules, produced by whatever means outside the cell into 
any virus, bacterial plasmid or other vector system so as to 
* r allow their incorporation into a host organism in which they 

; -ri do not naturally occur but in which they are capable of con- 

tinued propagation.' The definitions adopted by other countries 
are similar and all adequately describe the subject matter of 
~i /jSl^r*^ • this book. 

. This legal definition emphasizes the propagation of foreign 

>V nucleic acid molecules (the nucleic acid is nearly always DNA) 

v t?''X* - ■ in a different, host organism. The ability to cross natural species 

5* barriers and place genes from any organism in an unrelated 

* host organism is one important feature of gene manipulation. 
• ' A second important feature is the fact that a defined and 

. ^ relatively small piece of DNA is propagated in the host 
%^ organism. As we shall see, this has far-reaching consequences, 

J- " for it is then possible to obtain a pure DNA fragment in bulk. 

'/ This opens the door to a range of molecular biological 

'.y • opportunities including nucleotide sequence determination, 

site-directed mutagenesis, and manipulation of gene sequences 
to ensure very high level expression of an encoded polypeptide 
in a host organism. In addition, the DNA fragment provides a 
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molecular hybndization probe of absolute sequence puntv. 
totally uncontaminated by other sequences from the donor 
organism. 

The initial impetus tor gene manipulation :n vitro came about 
in the early 1970s with the simultaneous development of 
techniques for: 

1 transformation* of Escherichia coli, 

2 cutting and joining DNA molecules, and 

3 monitoring the cutting and joining reactions. 

In order to explain the significance of these developments 
we must first consider the essential requirements of a successful 
gene manipulation procedure. 

The basic problems 

Before the advent of modern gene manipulation methods there 
had been many early attempts at transforming pro- and 
eukaryotic cells with foreign DNA. But, in general, little 
progress could be made. The reasons for this are as follows. Let 
us assume that the exogenous DNA is taken up by the recipient 
cells. There are then two basic difficulties. First, where 
detection of uptake is dependent on gene expression failure 
could be due to lack of accurate transcription or translation. 
Second, and more importantly, the exogenous DNA may not be 
maintained in the transformed cells. If the exogenous DNA is 
integrated into the host genome, there is no problem. The exact 
mechanism whereby this integration occurs is not clear and 
it is usually a rare event. If the exogenous DNA fails to be 
integrated, it will probably be lost during subsequent multi- 
plication of the host cells. The reason for this is simple. In 
order to be replicated, DNA molecules must contain an on^m of 
replication and in bacteria and viruses there is usually only one 
per genome. Such molecules are called replicons. Fragments or 
DNA are not replicons and in the absence of replication will be 
diluted out of their host cells. It should be noted that even if a 
DNA molecule contains an origin of replication this may not 
function in a foreign host cell. 

There is an additional, subsequent problem. If the early 
experiments were to proceed, a method was required for 

'The sudden change of an animal cell possessing normal growth 
properties into one with many ot the growth properties of the cancer ceil 
is called grozvth transformation. Growth transformation is mentioned :n 
Chapter 11 and should not be confused with bacterial transformation 
which is described here. 
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assessing the fate of the donor DNA. In particular, in 
-ircumstances where the foreign DNA was maintained because 
it had become integrated in the host DNA, a method was 
required tor mapping the foreign DNA and the surrounding 
host sequences. 

THE SOLUTIONS: BASIC TECHNIQUES 

It fragments of DNA are not replicated, the obvious solution is 
to attach them to a suitable replicon. Such repltcons are known 
as vectors or cloning vehicles. Small plasmids and bacteriophages 
are the most suitable vectors for they are replicons in their 
own right, their maintenance does not necessarily require 
integration into the host genome and their DNA can be isolated 
readilv in an intact form. The different plasmids and phages 
which are used as vectors are described in detail in Chapters 3 
and 4. Suffice it to say at this point that initially plasmids and 
phages suitable as vectors were only found in Escherichia coli. 

Composite mpiefflj_es_in_which Joreign DNA _has been 
inserted into a vector molecule are somefimeT"called DNA 
chimaeras because of their analogy with" the" Chimaera of 
mythology— a creature with the head of a lion, body of a goat 
and the tail of a serpent. The construction of such compos i te, or 
artificial recombinant molecules h as also been termed g enetic 
engineering or ge ne man ipulation because of the potential for 
creati*ng"riovel genetic combinations by biochemical means. 
The process has also been termed molecular cloning or gen e 
cloning because a line of genetically identical organisms, all of 
"which contain the composite 'moIecuIeT can" be propagated and 
grown in bulk hence amplifying the composite molecule and any 
gene product whose synthesis it directs. 

Although conceptually very simple, cloning of a fragment of 
foreign, or passenger, or target DNA in a vector demands that 
the following can be accomplished. 

1 The vector DNA must be purified and cut open. 

2 The passenger DNA must be inserted into the vector 
molecule to create the artificial recombinant. DNA joining 
reactions must therefore be performed. Methods for cutting and 
joining DNA molecules are now so sophisticated that they 
warrant a chapter of their own (Chapter 2). 

3 fhe cutting and joining reactions must be readily 
monitored. This is achieved by the use of gel electrophoresis. 

4- Finally, the artificial recombinant must be transformed into 
£. coli, or other host ceil. Further details on the use of gel 
electrophoresis and transformation of E. coli are given in the 
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next section. As we have noted, the necessarv technics 
became available at about the same time and quicklv led ro 
many cloning experiments, the first of which were reported m 
1972 (Jackson et iL 1972, Lobban & Kaiser 1973). 

Agarose gel electrophoresis 

The progress of the first experiments on cutting and joinine of 
DNA molecules was monitored bv velocitv sedimentation in 
sucrose gradients. However, this has been entirelv superseded 
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Fig. 1.1 Electrophoresis of DNA in agarose gels. The direction of 
migration is indicated by the arrow. DNA bands have been visualised 
by soaking the gel in a solution of ethidium bromide i which complexes 
with DNA by intercalating between stacked base pairs) and 
photographing the orange fluorescence which results upon ultraviolet 
irradiation. (A) Phage A DNA restricted with Eco RI and then 
electrophoresed in a 1% agarose gel. The A restriction map is given in 
Fig. 4.4. (B) Open circular (OC) and super-coiled (SO forms of a 
piasmid of 6.4 kb pairs. Note that the compact super-coils migrate 
considerably faster than open circles. (O Linear piasmid (L) D\A 
produced by treatment of the preparation shown in lane B with Eco RI 
for which there is a single target site. Under the conditions or 
electrophoresis employed here, the linear form migrates just ahead or 
the open-circular form. 
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bv gel electrophoresis. Gel electrophoresis is not onlv used as 
an analytical method, it is routinely used preparatively tor the 
purification of specific DNA fragments. The gel is composed 
or polyacrylamide or agarose. Agarose is convenient for separ- 
ating DNA fragments ranging in size from a few hundred to 
about 20000 base pairs. Polyacrylamide is preferred for smaller 
ON* A fragments. 

A gel is a complex network of polymeric molecules. DNA 
molecules are negatively charged and under an electric held 
DNA molecules migrate through the gel at rates dependent 
upon their sizes: a small DNA molecule can thread its wav 
through the gel easily and hence migrates faster than a larger 
molecule (Fig. 1.1). Aaij and Borst (1972) showed that the 
migration rates of the DNA molecules were inversely pro- 
portional to the logarithms of the molecular weights. More 
recently, Southern (1979a,b) has shown that plotting fragment 
length or molecular weight against the reciprocal of mobility 
gives a straight line over a wider range than the semi- 
logarithmic plot. In any event, gel electrophoresis is frequently 
performed with marker DNA fragments of known size which 
allow accurate size determination of an unknown DNA mol- 
ecule by interpolation. A particular advantage of gel electro- 
phoresis is that the DNA bands can be readily detected at high 
sensitivity. The bands of DNA in the gel are stained with the 
intercalating dye ethidium bromide (Fig. 1.2), and as little 
as 0.05 Mg of DNA in one band can be detected as visible 
fluorescence when the gel is illuminated with ultraviolet light. 

In addition to resolving DNA fragments of different lengths, 
gel electrophoresis separates the different molecular configur- 
ations of a DNA molecule. The covalently closed circular, 
the nicked (relaxed) circular and linear forms of .a DNA mol- 
ecule have different mobilities (Fig. 1.1). Readers wishing to 



-e gets. The direction of 
mds have been visualised 
■ to m i de. ( vvh ich Complexes 
acked base_ pairs* and 
h result* uporrultraviottt 
wttk E« RI a jui4it€$ 
estdction mip.is given in 
:r-coiled (SO forms oi'i 
npact super-coils migrate 
Linear plasmid (L) DNA" 
own in lane.B with Eco Rl 
Under the conditions "of 
rm migrates just ahead qffc 




Fig. 1.2 Ethidium bromide. 
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know more about the factors affecting the electrophoretic 
mobility in agarose gels of the different conformational isomers 
of DNA should consult the paper bv Johnson and Grossman 
(1977V 

Southern blotting 

Frequently it is necessary to know what sequences in a DNA 
restriction fragment are transcribed into RNA, or to be able to 
map sequences by hybridization to restriction fragments. 
Clearly it would be helpful to have a method of detecting 
fragments in an agarose gel that are complementary to a given 
RNA or DNA sequence. This can be done by slicing the gel, 
eluting the DNA and hybridizing to radiolabeled 'probe' DNA 
or RNA either in solution, or after binding the restriction 
fragment to filters. This method, which is time consuming and 
inevitably leads to some loss of resolution, has now been 
replaced by a neat method described by Southern (1975, 1979b). 
This method, often referred to as Southern blotting is shown in 
Fig. 1.3. 

DNA restriction fragments in the gel are denatured by alkali 
treatment and the gel is then laid on top of buffer-saturated 
filter paper. The top surface of the gel is covered with a cellulose 
nitrate (often called nitrocellulose) filter and overlaid with dry 
filter paper. Buffer passes through the gel drawn by the dry filter 
paper, and carries the single-stranded DNA with it. When the 
DNA comes into contact with the cellulose nitrate it binds to it 
strongly. The DNA fragments can be permanently fixed to the 
cellulose nitrate by baking at 80 °C The filter can then be placed 
in a solution of radioactive RNA or denatured DNA which is 
complementary in sequence to the blot-transferred DNA. Condi- 
tions are chosen so that the radioactive nucleic acid hybridizes 
with complementary DNA on the cellulose nitrate. After a wash- 
ing step, the regions of hybridization can be detected autoradio- 
graphically by placing the cellulose nitrate in contact with photo- 
graphic film. The method is extremely sensitive and can even be 
used to map restriction sites around a single-copy gene sequence 

Sheets of dry filter caper 
Cellulose nitrate filter 
Gel containing denatured 
ONA 
Buffer 

Moist filter paper 
Fig. 1.3 The Southern blot' technique. See text for details. 
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Fig. 1--* Mapping restriction sites around a hypothetical gene se- 
quence in total genomic DNA by the Southern blot method. 

Genomic DNA is cleaved with a restriction endonuclease into hun- 
dreds of thousands of fragments of various sizes. The fragments are 
separated according to size by gel electrophoresis and blot-transferred 
on to cellulose nitrate paper. Highly radioactive RNA or denatured DNA 
complementary in sequence to gene X is applied to the cellulose nitrate 
paper bearing the blotted DNA. The radiolabeled RNA or DNA will 
hvbridize with gene X sequences and can be detected subsequently by 
autoradiography, so enabling the sizes of restriction fragments 
containing gene X sequences to be estimated from their electrophoretic 
mobility. By using several restriction endonucleases singly and in 
combination, a map of restriction sites in and around gene X can be built 
up. 



in a complex genome (Fig. 1.4). Since the radioactive nucleic acid 
is used here to search out and detect complementary sequences 
in the presence of a large amount of non-complementary DNA it 
is often referred to as the probe. 
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Northern blotting 

Southern's technique has been of enormous value, but it was 
thought that it could not be applied directly to the blot-transfer 
of RNAs separated by gel electrophoresis since RNA was found 
not to bind to cellulose nitrate. Alwine et ai (1979), therefore, 
devised a procedure in which RNA bands are blot-transferred 



from the gel onto chemically reactive paper where they are 
bound covalently. The reactive paper is prepared by diazo- 
tization of aminobenzyloxymethyl-paper which itself can be 
prepared from Whatman 540 paper by a series of uncompli- 
cated reactions. Once covalently bound, the RNA is available 
for hybridization with radiolabeled DN'A probes. As before, 
hybridizing bands are located by autoradiography. Alwine's 
method thus extends that of Southern and for this reason it 
has acquired the jargon term 'Northern' blotting! 

Because of the firm covalent binding of the RNA to the 
paper, such blot-transfers are reusable; the probe from previous 
hybridization reactions having been eluted by washing at a 
temperature at which hybrids are not . stable. Although 
originally devised for the transfer of RNA bands, the chemically 
reactive paper is equally effective in binding denatured DNA. 
In fact, small DNA fragments are more efficiently transferred to 
the diazotized paper derivative than to nitrocellulose. Extension 
of the fruitful blot-transfer approach has continued and more 
recently it has been found that RNA bands can indeed be blotted 
directly onto nitrocellulose under appropriate conditions (Tho- 
mas 1980). Because this form of Northern blotting does not 
require the preparation of reactive paper, it has been widely 
adopted. 

Boxing the compass has continued with the arrival of the 
term Western blot. This was sometimes used to describe the 
blotting of gel electrophoresed RNA onto nitrocellulose, as 
just described, but the term is now usually restricted to an appli- 
cation that does not involve nucleic acids, i.e. the detection of 
specific polypeptides by first electrophoresing them in poly- 
acrylamide gels, blotting them onto nitrocellulose to which they 
bind, and then detecting them by reaction with a specific, 
labelled antibody. 

Transformation of £. coli 

Early attempts to achieve transformation of E. coli were un- 
successful and it was generally believed that E. coli was refrac- 
tory to transformation. However, Mandel and Higa (1970) found 
that treatment with CaCU allowed E. coli cells to take up DNA 
from bacteriophage A. A few years later Cohen et ai (1972) 
showed that CaCl 2 -treated E. coli cells are also effective recipi- 
ents for plasmid DNA.. Aimost any strain of E. coli can be 
transformed with plasmid DNA, albeit with varying efficiency, 
whereas until recently it was thought that only recBC mutants 
could be transformed with linear bacterial DNA (Cosloy & Oishi 
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1973). More recently, Hoekstra et ai (1980) have shown that 
recBC cells can be transformed with linear DNA but the 
efficiency is only 10°b of that in otherwise isogenic recBC cells. 
Transformation of recBC cells with linear DNA is onlv poss- 
ible if the cells are rendered recombination proficient by the 
addition of a sbcA or ibcB mutation. The fact that the recBC 
gene product is an exonuclease explains the difference in 
transformation efficiency of circular and linear D\ : A in recBC" 
cells. 

As will be seen from the next chapter, many bacteria contain 
restriction systems which can influence the efficiency of 
transformation. Although the complete function of these 
restriction systems is not known yet, one role they do play is the 
recognition and degradation of foreign DNA. For this reason it 
is usual to use a restrictipniess mutant of £. coii as a trans- 
formable host. 

Since transformation of £. coii is an essential step in many 
cloning experiments it is desirable that it be as efficient as 
possible. Several groups of workers have examined the factors 
affecting the efficiency of transformation. It has been found 
that £. coii cells and plasmid DNA interact productively in an 
environment of calcium ions and low temperature (0-5 3 C) and 
that a subsequent heat shock (37-45 °C) is important, but not 
strictly required. Several other factors, especially the inclusion 
of metal ions in addition to calcium, have been shown to 
stimulate the process. 

Hanahan (1983) has re-examined factors that affect the 
efficiency of transformation, and has devised a set of conditions 
for optimal efficiency (expressed as transformants per pg 
plasmid DNA) applicable to most £. coii K12 strains. Typically, 
efficiencies of 10 7 or 10 8 transformants/pg can be achieved. 
Large DNAs transform less efficiently, on a molar basis, than 
small DNAs. Even with such improved transformation pro- 
cedures certain potential gene cloning experiments requir- 
ing large numbers of clones are not reliable. One approach 
which can be used to circumvent the problem of low transfor- 
mation efficiencies is to package recombinant DNA into virus 
particles in vitro. A particular form of this approach, the use of 
cosmids, is described in detail in Chapter 4. 

The biggest problem that prevents analysis of the trans- 
formation process is that the maximum level of transform- 
ation has not been extended beyond a few percent of the 
survivors, of the CaCl 2 treatment. Weston et ai (1979) found 
that when two separate plasmids are employed simultaneously 
at equally saturating concentrations of DNA the total number of 
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transformants obtained is the same as that when one plasmid 

alone is used. This confirms that every transformable cell is " a:rv 

transformed under these conditions suggesting that only a rrcp 

minor subpopulation of CaCl 2 -treated cells of £. coli are capable 

of taking up and establishing plasmid molecules. This idea is OTH 

supported by the work of Jones *?f al. (1981) who found that 

despite the fact that the transformation frequency increased ^' A '° 

100-fold for a 10-fold increase in growth rate, the maximum ^ 

number of transformable cells never exceeded 1%. Why the ^ ncv 

frequency of transformation is limited to 1% remains a mystery. ' 

The absolute requirement for CaCU for transformation of DNA 

£. coli is probably due to structural alterations in the cell wall kn ° v ' 

which it effects. In almost all Gram-negative bacteria which conv: 

have been transformed, CaCl : treatment is a mandatory step. sec *''' : ' 

Only a few Gram-negative bacteria, e.g. Haemophilus and .Wis- rnu: " 

sena, can be transformed without CaCl 2 treatment. T rou 

jbta: 

proc 

Transformation of other organisms rro;c 

po?- 

Although E. coli often remains the host organism of choice for :u:e ~ 
cloning experiments, many other hosts are now used and with 
them transformation may still be a critical step. In the case of 

Gram-positive bacteria the two most important groups of D ^ 
organisms are Bacillus sp. and actinomycetes. That B. subtilis is 

naturally competent for transformation has been known for a Thi: 

long time and hence the genetics of this organism are fairlv rea*: 

advanced. For this reason B. subtilis is a particularly attractive larc 

alternative prokaryotic cloning host. The significant features of en - 

transformation with this organism are detailed in Chapter 8. Of and 

particular relevance here is that it is possible to transform as 

protoplasts of B. subtilis, a technique which leads to improved d:d« 

transformation frequencies. A similar technique is used to aiw 

transform actinomycetes and recently it has been shown that ! 

the frequency can be increased considerably by first entrap- the 

ping the DNA in liposomes which then fuse with the host DN 

cell membrane. This is one aspect of transformation which ont 

undoubtedly will receive considerable attention in the near acc 

future. lab . 

In later chapters we discuss ways in which cloned DNA can DN 

be introduced into eukaryotic cells. With animal cells there is no ba.- 

great problem as only the membrane has to be crossed. In the 1 

case of yeast, protoplasts are required (Hinnan et al. 1978). With or 

higher plants the strategy which has been considered is either rea 

to package the DNA in a plant virus or use a bacterial plant (at 
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pathogen as the donor. It has also been shown that protoplasts 
prepared from plant cells are competent for transformation. 

OTHER TECHNIQUES 

Two other basic capabilities must be included in the repertoire 
of the gene manipulator. The first of these is DNA sequencing. 
Knowledge of the sequence of a cloned DN'A fragment is a 
prerequisite for planning any substantial manipulation of the 
DNA; for example, a computer search of the sequence for all 
known restriction endonuclease target sites will provide a 
complete and precise restriction map. Importantly, once the 
sequence has been manipulated in vitro, perhaps using in-vitro 
mutagenesis techniques, it is usually desirable to sequence the 
product so as to confirm that the required sequence has been 
obtained, the second capability is gene synthesis. Recent 
progress in this field has been very rapid. The synthesis of DNA 
molecules many hundreds of nucleotides in length is now 
possible. The ability to synthesize polymers up to 20 nucleo- 
tides in length quickly and easily has many important appli- 
cations which are discussed in later chapters. 



DNA sequencing by the Maxam and Gilbert method 

This method for DNA sequencing makes use of chemical 
reagents to bring about base-specific cleavage of the DNA. For 
large-scale sequencing it is now less favoured than the 
enzymatic, 'dideoxy', method. However, it still finds application 
and illustrates principles of polyacryiamide gel electrophoresis 
as applied to sequence determination. Discussion of the 
dideoxy method is deferred until Chapter 4 because it is nearly 
always used in combination with phage M13 cloning vectors. 

In the Maxam and Gilbert method (Maxam & Gilbert 1977), 
the starting point is a defined DNA restriction fragment. The 
DNA strand to be sequenced must be radioactively labelled at 
one end with a 32 P-phosphate group. (A detailed practical 
account of the entire sequencing procedure, including end- 
labelling methods, is available (Maxam & Gilbert 1980)|. This 
DNA can be either in single-stranded or duplex form. The 
base-specific cleavages depend upon the following points. 
1 Chemical reagents have been characterized which alter one 
or two bases in DNA (Table 1.1). These are base-specific 
reactions; for example, dimethyl sulphate methylates guanine 
(at the N7 position). 
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Table 1.1 Reagents for Maxam and Gilbert DNA sequencing. 



Base specificity 


Base reaction 


Altered base 
removal 


Strand 
cleavage 


1. G 


dimethvl sulphate 


pipendine 


piperidine 


2. G + A 


acid 


acid-catalysed 
depurination 


pipendine 


3. T + C 


hydrazine 


pipendine 


piperidine 


4. C 


hydrazine + NaCl 


piperidine 


piperidine 


5. A>C 


NaOH 


pipendine 


piperidine 



2 An altered base can then be removed from the sugar- 
phosphate backbone of DNA (Table 1.1). 

3 The strand is cleaved with piperidine at the sugar residue 
lacking the base. This cleavage is dependent upon the previous 
step. 

When each of the base-specific reagents is used in a limited 
reaction with end-labelled DNA, a nested set of end-labelled 
fragments of different lengths is generated. It is important to 
emphasize that the base-specific reactions are deliberately 
limited to give about one, or a few, cleavages per molecule. This 
is illustrated in Fig. 1.5 where the nested set of fragments 
produced by the G-specific reaction is given as an example. Sets 
of fragments are produced by reacting the DNA with each of the 
reagents separately (Table 1.1). All five reactions (1-5, Table 
1.1) may be performed; the fifth reaction gives redundant 



End-labelled 
DNA 



fe&pffp ApCpQpA 



End-labelled 
DNA fragments 



Fig, 1.5 Chemical cleavage of hypothetical DNA at G residues. A 
nested set of end-labelled DNA fragments is produced by limited 
reaction of an end-labelled DNA with G-specific reagents. Other 
fragments are produced but only the terminal fragments bear the label. 
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information but is confirmatory. These sequencing reactions 
are analysed by running the 4 or 5 samples side-by*side on a 
sequencing gel. 
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Fig. 1.6. Photograph of the Maxam and Gilbert autoradiograph 
(courtesy of N. Warburton). 
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A sequencing gel is a high resolution gel designed to 
fractionate single-stranded (denatured) DNA fragments. on the 
basis of their length. They routinely contain 6% to 20°o 
polyacrvlamide and 7m urea. The urea is a denaturant whose 
function is to minimize DNA secondary structure effects on 
electrophoretic mobility. The gel is run at sufficient power to 
heat up to about 70 : C. This also minimizes DNA secondary- 
structure. The labelled DNA bands obtained after such 
electrophoresis are revealed by autoradiography on large sheets 
of X-ray film. The sequence can then be read directly from the 
sequencing ladders in the adjacent base-specific tracks (Fig. 
1.6). 

The chemical synthesis of genes 

The basic method of gene synthesis is the repetitive formation 
of an ester linkage between an activated phosphoric acid 
function of one nucleotide and the hydroxyl group of another 
nucleoside or nucleotide thus forming the characteristic 
phosphodiester bridge. The major problem is that deoxy- 
ribonucleotides are very reactive molecules having a primary 
and secondary hydroxyl group, a primary amino group 
and a phosphate group. Consequently, blocking and de- 
blocking procedures are required and the chemistry involved 
must not result in scission or alteration of the phosphodiester 
backbone, the furanose rings, the sugar- purine/pyrimidine bond 
or the bases themselves— a tall order indeed! The various 
methods used to synthesize oligonucleotides are given in 
outline form below. The detailed chemistry is beyond the scope 
of this book. The reader wishing more information should 
consult the reviews of Caruthers et ai (1982) and Narang (1983). 

The original method for synthesizing oligonucleotides is 
the so-called phosphodiester approach. This method is shown 
in Fig. 1.7. The 5'-phosphate of one nucleotide, after suitable 
protection of other functional groups, is condensed with the 
3'-hydroxyl of another protected nucleoside or nucleotide using 
dicyclohexylcarbodiimide or an arylsulphonyl chloride. The 
chain can then be elongated by further step-wise conden- 
sations. The problems with the phosphodiester approach are 
long reaction times, rapidly decreasing yields as the chain 
length increases and time-consuming purification procedures. 
Nevertheless, the method was used successfully to synthesize 
a biologically active tRN A gene (Khorana 1979). 

The phosphotriester approach solves some of the problems 
of the phosphodiester method by blocking each internucleotide 
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Fig. 1-7 The phosphodiester method of synthesizing oligonucleotides. 
The various blocking groups shown are: MMTr, monomethoxvtritvl; 
Ac, acetyl; An, anisoyl. 



phosphodiester function during the course of building a defined 
sequence. The basic principle of the method is the use of a 
totally protected mononucleotide containing a fully masked 3'- 
phosphotriester group (Fig. 1.8). Using this methodology Edge 
et ai (1981) synthesized 67 oligonucleotides of chain length 
10-20 and spliced them together to generate a 517bp tnterferon- 
alpha gene. 

More recently the phosphite-triester method has been 
introduced and this makes use of the extreme reactivity of 
phosphite reagents. In this way the two building blocks are 
joined in a few minutes compared with the hours required by 
the phosphotriester approach. The basic principles of the 
method are shown in Fig. 1.9. Although the phosphite-triester 
method will work in solution for the condensation of 3 or 4 
nucleotides, construction of large oligonucleotides requires 
the 3' end of the desired oligonucleotide to be coupled to 
an insoluble support. Fortunately, immobilization simplifies 
manipulative procedures and cuts out time-consuming purifi- 
cation steps following each cycle of condensation. Appropri- 
ately blocked mononucleotides are added sequentially and 
reagents, starting materials and by-products removed by fil- 
tration. At the conclusion of the synthesis the deoxyoligo- 
nucleotide is chemically freed of blocking groups, hvdrolysed 
from the support and purified by electrophoresis or HPLC. By 
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^^Mesitylene sulphonyl tetrazolide 




OCH 2 CH 2 CN 

Fig. 1.8 The phosphotriester method of synthesizing oligonucleotides. 
The letter R or R' represent any one of the four bases; adenine, 
guanine, cytosine or thymine. DMTr represents the dimethoxvmrvi 
blocking group. 
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Z - Methyl 

Fig. 1.9 The phosphite-triester method or' synthesizing oligonucleo- 
tides. 



immobilizing the polymer support carrying the initiating deoxy- 
nucleotide in a column, the filtration steps can be replaced 
by a simple washing procedure and this lends itself to a fully 
automatic synthesis. Automatic gene synthesizers consist simply 
of reagent reservoirs whose contents are added to or removed 
from the immobilized protected oligonucleotide via valves 
controlled by a microcomputer. 



It 



'rCN 



uhesuTng olfgonudeoWes. 
' ifee four bases; adenine, 
reilhts the dimethoxytntyl 



Chapter 2 Cutting and Joining 
DNA Molecules 



CUTTING DNA MOLECULES 

It is worth recalling that prior to 1970 there was simply no 
method available for cutting a duplex DNA molecule into 
discrete fragments. DNA biochemistry was circumscribed by 
this impasse. It became apparent that the related phenomena 
of host-controlled restriction and modification might lead 
towards a solution to the problem when it was discovered 
that restriction involves specific endonudeases. The favourite 
organism of molecular biologists, E. coli K12, was the first to be 
studied in this regard, but turned out to be an unfortunate 
choice. Its endonuclease is perverse in the complexity of its 
behaviour. The breakthrough in 1970 came with the discovery 
in Haemophilus influenzae of an enzyme that behaves more 
simply. Present-day DNA technology is totally dependent upon 
our ability to cut DNA molecules at specific sites, with restric- 
tion endonudeases. An account of host-controlled restriction 
and modification therefore forms the first part of this chapter. 

Host-controlled restriction and modification 

Host-controlled restriction and modification are most readily 
observed when bacteriophages are transferred from one 
bacterial host strain to another. If a stock preparation of phage 
\, for example, is made by growth upon E. coli strain C and this 
stock is then titred upon £. coli C and £. coli K, the titres 
observed on these two strains will differ by several orders of 
magnitude, the titre on E. coli K being the lower. The phage are 
said to be restricted by the second host strain (E. coli K). When 
those phage that do result from the infection of E. coli K are now 
replaced on E. coli K they are no longer restricted; but if they are 
first cycled through £. coli C they are once again restricted when 
plated upon £. coli K (Fig. ll). Thus the efficiency with which 
phage h plates upon a particular host strain depends upon 
the strain on which it was last propagated. This non-heritable 
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Fig. 2.1 Host-controlled restriction and modification of phage A in £. 
coli strain K, analysed by efficiency of plating (E.O.P.). Phage pro- 
pagated by growth on strains K or C (i.e. A.K or K.C) have E.O.P.s on 
the two strains as indicated by arrows. £. coli C has no known 
restriction and modification system. 



change conferred upon the phage by the second host strain 
(£. coli K) that allows it to be replated on that strain without fur- 
ther restriction is called modification. 

The restricted phages adsorb to restrictive hosts and inject 
their DNA normally. When the phage are labelled with 32 P it is 
apparent that their DNA is degraded soon after injection 
(Dussoix & Arber 1962) and the endonuclease that is primarily 
responsible for this degradation is called a restriction endo- 
nuclease or restriction enzyme (Lederberg k Meselson 1964). 
The restrictive host must, of course, protect its own DNA from 
the potentially lethal effects of the restriction endonuclease 
and so its DNA must be appropriately modified. Modification 
involves methylation of certain bases at a very limited num- 
ber of sequences within DNA which constitute the recognition 
sequences for the restriction endonuclease. This explains 
why phage that survive one cycle of growth upon the restric- 
tive host can subsequently reinfect that host efficiently; their 
DNA has been replicated in the presence of the modifying 
methylase and so it, like the host DNA, becomes methylated 
and protected from the restriction system. 
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Chapter 2 

Although phage infection has been chosen as our example to 

illustrate restriction and modification, these processes can occur ^. 

whenever DNA is transferred from one bacterial strain to ^ 
another. Conjugation, transduction, transformation and trans- 

fection are all subject to the constraint of host-controlled ^ e 
restriction. The genes that specify host-controlled restriction 

and modification systems may reside upon the bacterial J " 
chromosome itself or mav be located on a piasmid or prophage 
such as PI. ' 

The restriction endonuclease of E. coli K was the first to be 
isolated and studied in detail. Meselson and Yuan (1968) the', 
achieved this by devising an ingenious assay in which a mu: 
fractionated cell extract was incubated with a mixture of [err 
unmodified and modified phage A DMAs which were differ- 
entially radiolabelled— one with 3 H, the other with 32 P— so [sol 
that they could be distinguished. After- incubation, the DN A Rob 
mixture was analysed by sedimentation through a sucrose -.ail 
gradient where the appearance of degraded unmodified DNA or 
in the presence of undegraded modified DNA indicated the wor 
activity of restriction endonuclease. ' n av 

The enzyme from £. coli K, and the similar one from £. sen 
coli B, were found to have unusual properties. In addition bac* 
to magnesium ions, they require the cofactors ATP and as*'. 
S-adenosyl-methionine, and DNA degradation in vitro is un- 
accompanied by hydrolysis of the ATP in amounts greatly re*: 
exceeding the stoichiometry of DNA breakage (Bickle et ai 
1978). In addition, the enzymes are now known to interact with 
an unmodified recognition sequence in duplex DNA and then No 
surprisingly, to track along the DNA molecule. After travelling 
for a distance corresponding to between 1000 and 5000 Tht 
nucleotides the enzyme cleaves one strand only of the DNA at ror 
an apparently random site, and makes a gap about 75 P 0; 
nucleotides in length by releasing acid-soluble oligonucleo- m0 
tides. There is no evidence that the enzyme is truly catalytic, 1 
and having acted once in this way a second enzyme molecule tirs 
is required to complete the double-strand break (Rosamond 5 P< 
et ai 1979). Enzymes with these properties are now known as ev 
type I restriction endonucleases. Like all restriction endo- H: 
nucleases they recognize specific nucleotide sequences. How- 2 
ever, they are not particularly useful for gene manipulation Ec 
since their cleavage sites are non-specific. Their biochemistry S e: 
still presents many puzzles; for instance, the precise role of s P l 
S-adenosyl-methionine remains unclear. ele 

While these bizarre properties of type I restriction enzymes 3 

were being unravelled, a restriction endonuclease from an 

ff influenzae Rd was discovered (Kelly & Smith 1970, Smith nu 
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k Wilcox 1970) that was to become the prototype of a large 
number of restriction endonucleases— now known as type il 
enzymes — that have none of the unusual properties displayed 
by type I enzymes and which are fundamentally important in 
the manipulation of DNA. The type II enzymes recognize a 
particular target sequence in a duplex DNA molecule and break 
the polynucleotide chains within, or near to, that sequence to 
give rise to discrete DNA fragments of defined length and 
sequence. In fact, the activity of these enzymes is often assayed 
and studied by gel electrophoresis of the DNA fragments which 
they generate (see Fig. 1.1). As expected, digests of small plas- 
mid or viral DNAs give characteristic simple DNA band pat- 
terns. 

Very many type II restriction endonucleases have now been 
isolated from a wide variety of bacteria. In a recent review, 
Roberts (1984) lists 475 enzymes that have been at least par- 
tially characterized, and the number continues to grow as 
more bacterial genera are surveyed for their presence. It is 
worth noting that many so-called restriction endonucleases 
have not formally been shown to correspond with any 
genetically identified restriction and modification system of the 
bacteria from which they have been prepared: it is usually 
assumed that a site-specific endonuclease which is inactive 
upon host DNA and active upon exogenous DNA is, in fact, a 
-estriction endonuclease. 



Nomenclature 

The discovery of a large number of restriction enzymes called 
for a uniform nomenclature. A system based upon the pro- 
posals of Smith and Nathans (1973) has been followed for the 
most part. The proposals were as follows. 

1 The species name of the host organism is identified by the 
first letter of the genus name and the first two letters of the 
specific epithet to form a three-letter abbreviation in italics: for 
example, Escherichia coli = Eco and Haemophilus influenzae = 
Hm. 

2 Strain or type identification is written as a subscript e.g. 
Eco K . In cases where the restriction and modification system is 
genetically specified by a virus or plasmid, the abbreviated 
species name of the host is given and the extrachromosomal 
element is identified by a subscript, e.g. Eco?\, Eco^. 

3 When a particular host strain has several different restriction 
and modification systems, these are identified by Roman 
numerals, thus the systems from H. influenzae strain Rd would 
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Table 2.1 Target sites for some restriction endonudeases. A 
comprehensive list is given in Appendix 2. 



Anabaena variabilis 


Ava I 


C;U/CGvG/G 




Bam HI 


GiGATCC 


Bacillus globigii 


Bgl II 


a;gatct 


Escherichia coli RY13 


Eco RJ 


GiAATTC 


bscneritnia cott Ki4j 


tCO Kil 


t rr(T)rc 


Haemophilus aegyptius 


Hae III 


GGiCC 


Haemophilus gallinarum 


H$al 


GACGC 


Haemophilus haemolyticus 


H/tal 


GCGiC 


Haemophilus influenzae Rd 


Hind II 


_jC\ /Av» 

ct(t)i(gmc 




Hind III 


AiAGCTT 


Haemophilus parainfluenzae 


. Hpal 


gttiaac 




Hpa II 


CiCGG 


Klebsiella pneumoniae 


Kpnl 


GGTACIC 


Moraxella bovis 


Mbol 


iGATC 


Providencia stuartii 


Pstl 


CTCCAiC 


Serratia marcescens 


Sma I 


CCC1GGG 


Streptomyces Stanford 


Sstl 


GAGCTiC 


Xanthomonas malvacearum 


Xma I 


CiCCGGG 



1,4 



Source: Roberts ( 1984). Recognition sequences are written from 5'-* 3', 
only one strand being given, and the point of cleavage is indicated by 
an arrow. Bases written in parentheses signify that either base may 
occupy that position. Where known, the base modified by the 
corresponding specific methylase is indicated by an asterisk. 
A is N°-methyladenine, C is 5-methylcytosine. 

Notes 

1,2. The names of these two enzymes are anomalous. The genes 
specifying the enzymes are borne on two Resistance Transfer Factors 
which have been classified separately. Hence RI and RII. 

3. Hga I is a Type II restriction endonuclease. cleaving as indicated: 

5' GACGCNNNNNi 

3' CTGCGNNNNN NNNNNt 

where N is any nucleotide. 

4. Under certain conditions (low ionic strength, alkaline pH or 50% 
glycerol) the Eco RI specificity is reduced so that only the internal 
tetranucleotide sequence of the canonical hexanucleotide is necessary 
for recognition and cleavage. This is so-called Eco RI* (Rl-star) activity. 
It is inhibited by parachloromercuribenzoate, whereas Eco RI activity 
is insensitive (Tikchonenko et ai 1978). Many other enzymes exhibit 
.star activity, i.e. reduced specificity, under suboptimal conditions. 
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be Hm d I, Hm d II, Hm d III, etc. These Roman numerals should 
not be contused with those in the classification of restriction 
enzymes into type I, etc. 

4 All restriction enzymes have the general name endonuclease 
R, but, in addition, cany the system name, e.g. endonuclease 
R.Hm d III. Similarly, modification enzymes are named methvl- 
ase M followed bv the svstem name. The modification enzvme 
from H. influenzae Rd corresponding to endonuclease R.Hin d [[[ 
is designated methylase M.Hm d IH. 

In practice this system of nomenclature has been simpli- 
fied further: 

(a) subscripts are typographically inconvenient: the whole 
abbreviation is now usually written on the line; 

(b) where the context makes it clear that restriction enzymes 
only are involved, the designation endonuclease R. is omitted. 
This is the system used in Table 2.1, which lists some of the 
more commonly used restriction endonucleases. A more 
extensive list is given in Appendix 2. 

Target sites 

The vast majority of, but not ail, type II restriction 
endonucleases recognize and break DNA within particular 
sequences of tetra-, penta-, hexa- or hepta-nucleotides which 
have an axis of rotational symmetry; for example, Eco Rl cuts at 
the positions indicated by arrows in the sequence 

axis of symmetry 



5-G'AA 
3-CT T 



TTC— 

A A.G — 
« r 



giving rise to termini bearing 5'-phosphate and 3'-hydroxyl 
groups. Such sequences are sometimes said to be palindromic by 
analogy with words that read alike backwards and forwards. 
(However, this term has also been applied to sequences such as 

5— ACCCCA— 
3-TCGCCT- 

which are palindromic within one strand, yet do not have an 
axis of rotational symmetry.) If the sequence is modified by 
methylation so that 6-methyladenine residues are found at one 
or both of the positions indicated by asterisks then the sequence 
is resistant to endonuclease R.Eco Rl. The resistance of the 
half-methylated site protects the bacterial host's own duplex 
DNA from attack immediately after semi-conservative repli- 
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DNA sequence, assuming ail bases are equally frequent. Anv 
particular hexanucleotide target would be expected to occur 
once in every 4° (i.e. 4096) nucleotide pairs. Some enzymes (e.g. 
Sau 3AI) recognize a tetranucleotide sequence that is included 
within the hexanucleotide sequence recognized bv a different 
enzyme (e.g. Bam HI). The cohesive termini produced by these 
enzymes are such that fragments produced by Sau 3AI will 
cohere with those produced by Bam HI. If the fragments are 
then covalently joined, the 'hybrid site' so produced will be 
once again sensitive to Sau 3AI, but may not constitute a target 
for Bam HI; this will depend upon the nucleotides adjacent to 
the original Sau 3A1 site (Fig. 2.3). Several other combinations of 
enzymes have this property. 

From Table 2.1 we can also see that Hind II, the first tvpe II 
enzyme to be discovered, is an example of an enzyme 
recognizing a sequence with some ambiguity; in this case all 
three sequences corresponding to the structure given in Table 
2.1 are substrates. There are also several known examples of 
enzymes from different sources which recognize the same 
target. They are isoschizomers. Some pairs of isoschizomers cut 
their target at different places (e.g. Sma I, Xma I). 

In our discussion of the phenomena of restriction and 
modification of phage A by £. coli K, we saw that methylation 
was the basis of modification in that system. In the wide varietv 
of type II restriction enzymes now known there are some 
curious and useful examples of the influence of methyl groups 
at restriction sites. The enzymes Hpa II and Msp I are 
isoschizomers with the target sequence CCGG. Hpa II will not 
cut the target when it contains 5-methylcytosine as indicated by 
the asterisk CCGG (indeed this is the product of M. Hpa II). 
However, Msp I is known to be indifferent to methylation at 
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Fig. 2.3 Production of a hybrid site by cohesion of complementary 
sticky ends generated by Sau 3A and Bam HI. 
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this nucleotide: it cleaves whether or not this C residue 
is methylated. Xow it has been found that over 90% or the 
methyl groups in genomic DNA or many animals, including 
vertebrates and echinoderms, occur as 5-methylcytosine in the 
sequence CG. Many of these methyl groups occur at Msp I sites, 
and their presence can be detected by comparing digests of the 
DNA generated by Hpa II and Msp I. Indeed methylation at Msp 
I sites around a single gene can be investigated in detail by 
combining the use of these two enzymes with the Southern- 
blot technique (Bird & Southern 1978, Razin k Riggs 1980). 

Recently a new kind of type II restriction endonuclease has 
been identified. Enzymes of this type make breaks in the two 
strands at measured distances to one side of their asymmetric 
target sequence (e.g. Hga I; see Table 2.1). Finally, type III re- 
striction enzymes have been classified as those which cleave 
DNA at well-defined sites, require ATP and Mg : ~, but which 
have only a partial requirement for (i.e. are stimulated by) 
S-adenosylmethionine. In these respects they have properties 
intermediate between type I and type II enzymes (Table 2.2). 

The wide variety of properties exhibited by restriction 
endonucleases described in the preceding paragraphs has 
provided great scope for ingenious and resourceful gene 
manipulators. This will be apparent from examples in following 
chapters. 

What is the function of restriction endonucleases in vivo? 
Clearly host-controlled restriction acts as a mechanism by 
which bacteria distinguish self from non-self. It is analogous to 
an immunity system. Restriction is moderately effective in 
preventing infection by some bacteriophages. It may be for this 
reason that the T-even phages (T2, T4 and T6) have evolved 
with glucosylated hydroxymethylcytosine residues replacing 
cytosine in their DNA, so rendering it resistant to many 
restriction endonucleases. The restriction and glucosylation 
modification of T-even phage DNA is beyond the scope of this 
book. For a detailed discussion the reader is referred to 
Kornberg (1980). However, it is worth noting that a mutant 
strain of T4 is available which does have cytosine residues in its 
DNA and is therefore amenable to conventional restriction 
methodology (Velten et ai 1976, Murray et ai 1979, Krisch & 
Selzer 1981). As an alternative to the unusual DNA structure of 
the T-even phages, other mechanisms appear to have evolved 
in T3 and T7 for overcoming restriction in vivo (Spoerel et ai 
1979). In spite of this evidence we may be mistaken in 
concluding that immunity to phage infection is the sole or main 
function of restriction endonucleases in nature. 
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Mechanical shearing of DNA 

In addition to digesting DNA with restriction endonucleases to 
produce discrete fragments, there are a variety of treatment5 
which result in non-specific breakage. Non-specific endo- 
nucleases and chemical degradation can be used but the only 
method that has been much applied to gene manipulation 
involves mechanical shearing. 

The long, thin threads which constitute duplex DNA 
molecules are sufficiently rigid to be very easily broken by 
shear forces in solution. Intense sonication with ultrasound can 
reduce the length to about 300 nucleotide pairs. More controlled 
shearing can be achieved by high-speed stirring in a blender. 
Typicallv, high mot. wt DNA is sheared to a population ot 
molecules with a mean size of about 8 kb pairs by stirring at 
1500 rev/min for 30 min (Wensink et al 1974). Breakage occurs 
essentially at random with respect to DNA sequence. The ter- 
mini consist of short single-stranded regions which may have to 
be taken into account in subsequent joining procedures. 

JOINING DNA MOLECULES 

Having described the methods available for cutting DNA mol- 
ecules we must consider the ways in which DNA fragments can 
be joined to create artificially recombinant molecules. There are 
currently three methods for joining DNA fragments in vitro. 
The first of these capitalizes on the ability of DNA ligase to join 
covalently the annealed cohesive ends produced by certain 
restriction enzvmes. The second depends upon the ability ot 
DNA ligase from phage T4-infected £. coli to catalyse the for- 
mation of phosphodiester bonds between blunt-ended frag- 
ments. The third utilizes the enzyme terminal deoxynucleotidvl- 
transferase to synthesize homopolymeric 3' -single-stranded 
tails at the ends of fragments. We can now look at these three 
methods a little more deeply. 



DNA ligase 

£. coli and phage T4 encode an enzyme, DNA ligase, which 
seals single-stranded nicks between adjacent nucleotides in a 
duplex DNA chain (Oiivera et al. 1968, Gumport & Lehman 
1971). Although the reactions catalysed by the enzymes of E. coli 
and T4-infected E. coli are very similar, they differ in their 
cofactor requirements. The T4 enzyme requires ATP, whilst the 
£ coli enzvme requires NAD*. In each case the cofactor is split 
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Fig. 2.4. Action of DN A ligase. An enzyme- AMP complex binds to a 
nick bearing 3'-OH and 5 -P groups' The AMP reacts with the 
phosphate group. Attack by the 3' -OH group on this moiety generates 
a new phosphodiester bond which seals the nick. 

and forms an enzyme-AMP complex. The complex binds to the 
nick, which must expose a 5'-phosphate and 3'-OH group, and 
makes a covalent bond in the phosphodiester chain as shown in 
Fig. 2.4. 

When termini created by a restriction endonudease which 
creates cohesive ends associate, the joint has nicks a few base 
pairs apart in opposite strands. DNA ligase can then repair 
these nicks to form an intact duplex. This reaction, performed in 
vitro with purified DNA ligase, is fundamental to many gene 
manipulation procedures, such as that shown in Fig. 2.5. 

The optimum temperature for ligation of nicked DNA is 
37 °C, but at this temperature the hydrogen- bonded joint 
between the sticky ends is unstable. Eco Rl-generated termini 
associate through only four AT base pairs and these are not 
sufficient to resist thermal disruption at such a high tem- 
perature. The optimum temperature for ligating the cohesive 
termini is therefore a compromise between the rate of enzyme 
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Fig. 2.5 Use of DNA ligase to create a covalent DNA recombinant 
joined through association of termini generated by Eco RI. 

action and association of the termini, and has been found by 
experiment to be in the range 4-15 X (Dugaicyzk et ai 1975, 
Ferretti k Sgaramella 1981). 

The ligation reaction can be performed so as to favour 
the formation of recombinants. First, the population of re- 
combinants can be increased by performing the reaction at a 
high DNA concentration; in dilute solutions circularizatwn of 
linear fragments is relatively favoured because of the reduced 
frequencv of intermodular reactions. Second, by treating 
linearized plasmid vector DNA with alkaline phosphatase to 
remove 5' -terminal phosphate groups, both recircularization 
and plasmid dimer formation are prevented (Fig. 2.6). In this 
case, circulanzation of the vector can occur only by insertion of 
non-phosphatase-treated foreign DNA which provides one 5'- 
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FiK 2.6 Application of alkaline phosphatase treatment to prevent 
rearcularUa.ion of vector plasmid without insertion ot fore.gn DNA. 

terminal phosphate at each join. One nick at each join remains 
unligated, but after transformation of host bactena, cellular 
repair mechanisms reconstitute the intact duplex. 

Joining DNA fragments with cohesive ends by DNA hgase 
is a relatively efficient process which has been used extensively 
to create artificial recombinants. A modification of this 
procedure depends upon the ability of T4 DNA ligase to ,oin 
b unt-ended DNA molecules (Sgaramella 1972). The E. col, DNA 
liease will not catalyse blunt ligation except under special 
reaction conditions of macromolecular crowding (Zimmerman 
er al 1983). Blunt ligation is most usefully applied to pining 
blunt-ended fragments via linker molecules; for example. 
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Scheller et al. (1977) have synthesized self-complementary 
decameric oligonucleotides which contain sites tor one or more 
restriction endonucleases. One such molecule is shown in Fig. 
2.7. The molecule can be ligated to both ends of the foreign 
DNA to be cloned, and then treated with restriction endo- 
nuclease to produce a sticky-ended fragment which can be 
incorporated into a vector molecule that has been cut with the 
same restriction endonuclease. Insertion by means of the linker 
creates restriction enzyme target sites at each end of the foreign 
DNA and so enables the foreign DNA to be excised and 
recovered after cloning and amplification in the host bacterium. 
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Fig. 2.7 A decameric linker molecule containing an Eco RI target site is 
joined by T4 DNA ligase to both ends of flush-ended foreign DNA. 
Cohesive ends are then generated by Eco RI. This DNA can then be 
incorporated into a vector that has been treated with the same 
restriction endonuclease. 
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Fie. 2.8 Double-linkers. mRNA is cop.ed into double-stranded cDNA 
nd Sal I-linkers <S) are added. The hairpin loop forme by *K- 
priming of second strand synthesis is removed by nuclease SI. In this 
Ziure. any raggedness let, by nuclease SI (i.e. short 
projections at the terminusl is removed by polishing with Klenow 
P po Imerase. Eco Rl-linkers are then Hgated to the ^duplex mc and 
cohesive termini revealed by restrict.on w.th Sal I and , E« RL The 
cDNA plus linkers is then ligated into a vector cut with the same two 
enzvmes. 
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Double-linkers 

Plasmid vectors have been derived which contain a set of 
closelv clustered cloning sites. An example ot such a vector is 
oUCS which is described in more detail in Appendix 6 This _°r- 
vector has been used to clone duplex cDNA molecules by the 
double-linker approach (Kurtz & Nicodemus 1981 Heltman <t 
al 1983) in which different linker molecules are added to the 
opposite ends of the cDNA (Fig. 2.8). Th.s has the following 

advantages. • ; ■ 

1 The problem of vector reclosure without insertion ot toreign 
DNA is overcome. Partlv for this reason the method is etficient, 

1 e . cDNAs have been cloned which were derived trom rare 
mRNA molecules in the starting population. 

2 The use of linkers, rather than homopolymers, is desirable 
when expression from a vector-borne promoter is sought (see 

Chapter?). .. 

3 The orientation of the inserted DNA is fixed. 

Adaptors 

It mav be the case that the restriction enzyme used to generate 
the cohesive ends in the linker will also cut the foreign DNA at 
internal sites. In this situation the foreign DNA will be cloned 
as two or more subfragments. One solution to this problem is to 
choose another restriction enzyme, but there may not be a 
suitable choice if the foreign DNA is large and has sites for F , 5 . 
several restriction enzvmes. Another solution is to methylate 
.nternal restriction sites with the appropriate edification 
methvlase. An example of this is described in Chapter * 
Alternatively, a general solution to the problem is ; provided by 
chemically synthesized adaptor molecules which have a 
preformed cohesive end (Wu et al. 1978). Consider a blunt-ended Hom 
foreign DNA containing an internal Bam HI site (Fig. -.9). 
which is to be cloned in a Bam Hi-cut vector. The Bam adaptor 
molecule has one blunt end. bearing a S'-phosphate : group and 
a Bam cohesive end which is not phosphorylated. The adaptor 
can be ligated to the foreign DNA ends, without any risk of 
adaptor felf- P olymerizat,on. The foreign DNA plus added 
adaptors is then phosphorylated at the d •termini and ligated 
nto the Bam HI site of the vector. If the foreign DNA were to b 
ecovered from the recombinant with Bam HI, it would be 
obtamed in two fragments. However the adaptor is designed o 
contain two other restriction sites (Sma I. Hpa II) which may 
enable the foreign DNA to be recovered intact. 
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Fig. 2.9 Use of a Sum HI adaptor molecule. A synthetic adaptor 
molecule is ligated to the foreign DNA. The adaptor is used in the 
5 -hydroxyi form to prevent self-polymerization. The foreign DNA 
plus ligated adaptors is phosphorylated at. the 5'-termini and ligated 
into the vector previously cut with Bam HI. 



Homopolymer tailing 

A general method for joining DNA molecules makes use of the 
annealing of complementary homopolymer sequences. Thus, 
by adding oligo (dA) sequences to the 3' ends of one population 
of DNA molecules, and oligo (dT) blocks to the 3' ends of 
another population, the two types of molecule can anneal to 
form mixed dimeric circles (Fig, 2.10). 

An enzyme purified from calf-thymus, terminal deoxy- 
nucleotidyl-transferase, provides the means by which the 
homopolymeric extensions can be synthesized, for if presented 
with a single deoxynudeotide triphosphate it will repeatedly 
add nucleotides to the 3 -OH termini of a population of DNA 
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Fig. 2.10 Use of caif-thymus terminal deoxynucieotidyl-transferase to 
add complementary homopolymer tails to two DNA molecules. 



molecules (Chang k Bollum 1971). DNA with exposed 3' -OH 
groups, such as arise from pretreatment with phage K exo- 
nuclease or restriction with an enzyme such as Pst I, is a very 
good substrate for the transferase. However, conditions have 
been found in which the enzyme will extend even the shielded 
3'*OH of 5'-cohesive termini generated by Eco RI (Roychoud- 
hury et ai 1976, Humphries et ai 1978). 

The terminal transferase reactions have been recently charac- 
terized in detail with regard to their use in gene manipu- 
lation (Deng k Wu 1981, Michelson k Orkin 1982). Typically, 
10-40 homopolymeric residues are added to each end. 

In 1972, Jackson et at. were among the first to apply the 
homopolymer method when they constructed a recombinant in 
which a fragment of phage \ DNA was inserted into 5V40 DNA. 
In their experiments, the single-stranded gaps which remained 
in the two strands at each join were repaired in vitro with DNA 
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polymerase and DNA ligase so as to produce covalentlv closed 
circular molecules, which were then used to transtect sus- 
ceptible mammalian cells (see Chapter 11). 

Subsequently, the homopolymer method, employing either 
dA.dT or dG.dC homopolymers, has been applied extensively 
in constructing recombinant plasmids tor cloning in E. coli. 
Commonly, the annealed circles are used directly for trans- 
formation with repair of the gaps occurring in vivo. In the 
example which follows we shall see how homopolymer tailing 
can be applied to cloning DNA copies of eukaryotic messenger 
RNA and how a careful choice of which homopolymers are used 
can be important. 

Cloning cDNA by homopolymer tailing 

If we wish to construct a clone containing sequences derived 
from eukaryotic mRNA, we must first obtain the sequence in 
DNA form. We can do this by making a complementary (cDN A) 
copy of the mRNA, using the enzyme reverse transcriptase, 
which is a type of DNA polymerase found in retroviruses, and 
whose function is to synthesize DNA upon an RNA template. 

Like other true DNA polymerases, reverse transcriptase can 
only synthesize a new DNA strand if provided with a growing 
point in the form of a pre-existing primer which is base-paired 
with the template and bears a free 3' -OH group. Fortunately, 
most eukaryotic mRNAs occur naturally -in a polyadenylated 
form with up to 200 adenylate residues at their 3'-termini and 
hence we can provide a primer simply by hybridizing a short 
oiigo (dT) molecule with this poly (A) sequence. The primer is 
then suitably located for synthesis of a complete cDNA by 
reverse transcriptase in the presence of all four deoxynudeoside 
triphosphates (Fig. 2.11). 

The immediate product of the reaction is an RNA-DNA 
hybrid. The RNA strand can then be destroyed by alkaline 
hydrolysis, to which DNA is resistant, leaving a single- 
stranded cDNA which can be converted into the double- 
stranded form in a second DNA polymerase reaction. This 
reaction depends upon the observation that cDNAs can form a 
transient self-priming structure in which a hairpin loop at the 
3'-terminus is stabilized by enough base-pairing to allow 
initiation of second-strand synthesis. Once initiated, sub- 
sequent synthesis of the second strand stabilizes the hair- 
pin (Efstratiadis et aL 1976, Higuchi et al. 1976). The hairpin 
and any single-stranded DNA at the other end of the cDNA 
molecule are then trimmed away by treatment with the single- 
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Full-length cDNA cloning 

in the previous cDNA cloning scheme second strand synthesis 
is self-primed, resulting in the formation of a duplex cDNA 
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Fig. 2.14 The Okayama and Berg method of cDN A cloning, (a) Prepara- 
tion of plasmid primer and adaptor DNA. The unshaded portion of each 
ring is pBR322 DNA, and the shaded or stippled segments are from SV40 
DNA. 

Fig. 2.13 (facing page) Efficient full-length cDNA cloning (Heidecker & 
Messing 1983). The mRNA is annealed to linearized and oligo-dT tailed 
plasmid DNA, which then primes synthesis of the first cDNA strand. 
Oligo-dG tails are added to the cDNA-plasmid molecules, which are 
then centrifuged through an alkaline sucrose gradient. This step 
removes small molecules, hydrolyses the mRNA and separates the two 
cDNAs which were formerly attached to the same duplex plasmid. 
Denatured, oligo-dC tailed plasmid DNA is added (in excess) and 
conditions adjusted to favour circularization by the complementary 
homopolymer tails. The excess oligo-dC tailed plasmid may simply 
renature, but cannot circularize. The circular molecules have a free 
3'-hydroxyl on the oligo-dC tail which primes second strand synthesis 
of the cDNA to create duplex recombinant plasmids which transform 
£. coli. Clones can be obtained with the cDNA inserted in both 
orientations. 
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dC tailing of single-stranded cDNA followed by oligo-dC 
priming of second strand synthesis (Fig. 2.12) does not lead to 
hairpin formation, nuclease SI treatment is not required, and 
consequently is an effective method for generating full-length 
cDNA clones (Land et al. 1981). Two further methods, shown 
schematically in Figs 2.13 and 2.14, have been devised to 
eliminate the use of nuclease SI. Additionally, in both methods 
the oiigo-dT sequence for priming the first strand cDNA 
synthesis is linked to the vector DNA in a prior reaction. Both 
methods have been reported to promote full length cDNA 
cloning with a very high efficiency (Okayama k Berg 1982, 
Heidecker & Messing 1983). It is thought that full length 
reverse transcripts are obtained preferentially because in each 
case an RNA-DNA hybrid molecule, which is the result of first 
strand synthesis, is the substrate for a terminal transferase 
reaction. A cDNA that does not extend to the end of the mRNA 
will present a shielded 3'-hydroxy group, which is a poor 
substrate for tailing. 



